Data Mining Within DBMS Functionality
نویسنده
چکیده
Data mining slowly evolves from simple discovery of frequent patterns and regularities in large data sets toward interactive, user-oriented, on-demand decision supporting. Since data to be mined is usually located in a database, there is a promising idea of integrating data mining methods into database management systems (DBMS). In this paper we present the results of developing our research prototype for DBMS-integrated data mining. We focus on two main contributions: query language for data mining and constraints-driven algorithm for association rules discovery.
منابع مشابه
Data Mining Support in Database Management Systems
The most popular data mining techniques consist in searching databases for frequently occurring patterns, e.g. association rules, sequential patterns. We argue that in contrast to today's loosely-coupled tools, data mining should be regarded as advanced database querying and supported by Database Management Systems (DBMSs). In this paper we describe our research prototype system, which logicall...
متن کاملThe Drill Down Benchmark
Data Mining places specific requirements on DBMS query performance that cannot be evaluated satisfactorily using existing OLAP benchmarks. The DD Benchmark defined here provides a practical case and yardstick to explore how well a DBMS is able to support Data Mining applications. It was derived from real-life data mining tasks performed by our Data SurveyorTM tool running on a variety of DBMS b...
متن کاملA System Architecture for Database Mining Applications
The problem of enhancing a database management system(DBMS) to support mining applications is twofold. First DBMSs of today have limited functionality for supporting mining applications. Second scaling traditional knowledge discovery techniques for large data sets is not straight forward. Our goal is to propose a system architecture for future DBMSs that incorporate interactive modules for data...
متن کاملParallel Multithreaded Processing for Data Set Summarization on Multicore CPUs
Data mining algorithms should exploit new hardware technologies to accelerate computations. Such goal is difficult to achieve in database management system (DBMS) due to its complex internal subsystems and because data mining numeric computations of large data sets are difficult to optimize. This paper explores taking advantage of existing multithreaded capabilities of multicore CPUs as well as...
متن کاملSIPping from the Data Firehose
When mining large databases, the data extraction problem and the interface between the database and data mining algorithm become important issues. Rather than giving a mining algorithm full access to a database (by extracting to a flat file or other directlyaccessible data structure), we propose the SQL Interface Protocol (SIP), which is a framework for interaction between a mining algorithm an...
متن کامل